CDS
Accession Number | TCMCG064C17479 |
gbkey | CDS |
Protein Id | XP_020551052.1 |
Location | complement(join(4921001..4921264,4921616..4921750,4921826..4922426,4922804..4923063,4923143..4923480,4923619..4923955,4925209..4925513,4925688..4925754)) |
Gene | LOC105166266 |
GeneID | 105166266 |
Organism | Sesamum indicum |
Protein
Length | 768aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA268358 |
db_source | XM_020695393.1 |
Definition | THO complex subunit 5B isoform X2 [Sesamum indicum] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGATTAATAATAGAGTACTAATGGTTCACCAGGAAGGACGATGTAATTATAATATAACTACAGTAGTGCTCTTGAAACAGGCAAATAGGTCTATCTTGCTTGAAGAAGACAGAGTAAAAGCAGATACTGAACGCGCTAAAGCACCTGTGGACCTCACAACCTTGCAGCTCCATAATTTGATGTATGAGAAAAATCACTATGTTAAAGCAATAAAAGCTTGCAAAGACTTTAAGACTAAATATCCTGATATTGAACTTGTACCCGAGGAAGAATTCCTCAGAGATGCCCCAGAAGACATTAAAAGCTCCACATTATCAACTGACAGTGCGCATGATTTGATGCTTAAAAGGCTCAACTATGAGCTTTTCCAGCGCAAAGAATTATGCAAGCTTCGTGATAAGTTGGAACTACAAAGAAAAGCTCTCGAAGAGACAATTGCTAACAGGAAAAAGTTCTTATCAAGTCTCCCTTCACACCTCAAAGCTCTCAAAAAGGCATCCTTGCCTGTGCAACATCAGTTGGGGCTTCTGCATACCAAGAAACTAAAGCAGCAGCAATTAGCAGAGTTGCTCCCACCTCCTCTCTACATAATCTACTCTCAGTTACTTGCTCAGAAGGAAGCGTTTGGAGAGAATATTGAACTGGAGATTGCAGGAAGTGTAAAGGATGCACAGGCTTTTGCGCGCCAGCTTGCAAATAAGGACTCTGCTATATTAACAAACTTAGAGAATTCCAAGTTGGAAGATGATGTGCCTGATGAGGAAGACGATGGTCAAAGGAGGAGAAAGCGGCCAAAGAAGGTTCTAAGCAAGGATAACCATGACCAGTCTGGAATATATCAAAGTCATCCTCTTAAAGTTTCCCTCCACATAAGTGATGATGAAGCTTCGGACTTGAACTCAGCAAAACTCATCTCCTTGAAGTTTGAGTTCTTAATAAAGTTGAATGTTGTGTGTGTAGGAGTAGAAGGCTCTGAAGAAGATCCTCAAAACAATATCTTGTGCAACTTATTTCCTGATGACACTGGCCTTGAGCTCCCTCTGCAGTCAGCAAAGCTCTGGATTGGCAATTCTTTTTCATTTGATGATAGGCGAACTTCACGGCCTTACAAATGGGTCCAGCATTTGGCAGGAATTGATGTCTTGCCAGAGGTTTCGCCACTGATTTCAGTCTCTGGAGACTCTAATAGTGAGACTACTAGACACGGTTCTGTTCTGTCAGGTCTGTCATTATATCGTCAGCAGAACAGAGTGCAGACAGTTGTGCAAAGGATTTGTGCTCGTAAAAAGGCTCAGCTGGCTCTTGTGGAGTTACTTGATTCGCTAAGGAAGCTTACTTGGCCTACTTTTACCTGTGAAAGCGTTCCATGGGCTTCATACACTCCACACTGCAATTTGCATGGCTGGCTATCCATGACTTCAGCTGGTAACAGTACTACATCTCTGCCACTGGTTGATGCAGAACAGAGTCAGGGTCCTACAAGTGTCAATGCAGATAGAAACTCTGGTAGGTCCAAGGAGATGGAGACCACAACAGAAGATGGGGAGCTTCCATCTTTGGTTCCAGTTGCTAATGGTGTAAATGATGTTGGACTCACCCCCACAAAAGGATCTGAACTTGAGAATTCCAGAAGGCTGAGTTTGATTTCAAAAAGTATCATGTCCCCAATCAACAAGGGGAAGTCACCAAGTTTTAAGAAGCTTGAGGAGGATGTTGATCTCATGCTGGAATCTGATAATGAGCTTGATGAACCAGTTAAAGTGGAGGAAACATCTGATAATGCATCACCATTGGGAGAACTAGCATTTGTTGACAATTCATGGGCGGACTGTGGGGTTCAAGAATACAGTCTTGTACTAACTCGTAGGTTGGACAATGATGACAGGATTATGAAATTGGAAGCCAAGATCAAAATAAGCACAGAATATCCTCTTAGGCCTCCTCATTTTGGACTGAGTCTTTATAGTTCCTCACAAGGAGAGAACTACTTCGTGTCTAATGGTTCGAGGTGGTACAATGAACTTCGTGCAATGGAGGCAGAGGTCAATGTTCACATAATAAGGATGATACCGTTCGATCAAGAAAATTTAATTCTAGGTCATCAAGTGCTTTGCCTTGCGATGCTGTTTGACTTCTTCGTGGATGATGGGAATCCTTCTGAGAAGCGAAGGTCTACTTCAGTGATTGATGTTGGTTTATGCAAGCCTGTAAGTGGAAGGCTTGTCAGCCGATCTTTTAGAGGTCGGGATCGTAGGAAAATGATTTCATGGAAAGACAACACCTGCACTCCTGGTTATCCTTACTAG |
Protein: MINNRVLMVHQEGRCNYNITTVVLLKQANRSILLEEDRVKADTERAKAPVDLTTLQLHNLMYEKNHYVKAIKACKDFKTKYPDIELVPEEEFLRDAPEDIKSSTLSTDSAHDLMLKRLNYELFQRKELCKLRDKLELQRKALEETIANRKKFLSSLPSHLKALKKASLPVQHQLGLLHTKKLKQQQLAELLPPPLYIIYSQLLAQKEAFGENIELEIAGSVKDAQAFARQLANKDSAILTNLENSKLEDDVPDEEDDGQRRRKRPKKVLSKDNHDQSGIYQSHPLKVSLHISDDEASDLNSAKLISLKFEFLIKLNVVCVGVEGSEEDPQNNILCNLFPDDTGLELPLQSAKLWIGNSFSFDDRRTSRPYKWVQHLAGIDVLPEVSPLISVSGDSNSETTRHGSVLSGLSLYRQQNRVQTVVQRICARKKAQLALVELLDSLRKLTWPTFTCESVPWASYTPHCNLHGWLSMTSAGNSTTSLPLVDAEQSQGPTSVNADRNSGRSKEMETTTEDGELPSLVPVANGVNDVGLTPTKGSELENSRRLSLISKSIMSPINKGKSPSFKKLEEDVDLMLESDNELDEPVKVEETSDNASPLGELAFVDNSWADCGVQEYSLVLTRRLDNDDRIMKLEAKIKISTEYPLRPPHFGLSLYSSSQGENYFVSNGSRWYNELRAMEAEVNVHIIRMIPFDQENLILGHQVLCLAMLFDFFVDDGNPSEKRRSTSVIDVGLCKPVSGRLVSRSFRGRDRRKMISWKDNTCTPGYPY |